Learning with options: Just deliberate and relax
نویسنده
چکیده
Bounded rationality is a very important framework for understanding rationality in both natural and artificial systems. In this paper, we aim to bridge the gap between bounded rationality and reinforcement learning, which has also proven very fruitful in both types of intelligent systems. A lot of reinforcement learning work has focused on Markov Decision Processes, where optimal policies can be obtained under certain assumptions. However, optimality does not take into account possible resource limitations of the agent, which is assumed to have access to a lot of data and computation time. One approach for reducing the costs of such agents has been to provide them with temporally extended actions and models, which allow policies to be computed faster, cf. [Dietterich, 2000; Precup, 2000]. However, the problem of automatically finding good temporal abstractions has proven very difficult. Part of this difficulty stems from the fact that from the point of view of absolute optimality, temporal abstractions are not necessary: the optimal policy is achieved by primitive actions. Therefore, it has been difficult to formalize in what precise theoretical sense temporally abstract actions are helpful.
منابع مشابه
Designing reading tasks to maximise vocabulary learning
Most vocabulary learning should occur incidentally through listening and reading. This is one of the reasons why a substantial extensive reading program is an important part of an English course. Extensive reading requires the learners to do large quantities of reading using material that is at the right level for them. Vocabulary learning occurs through th...
متن کاملLearning Martingale Measures to Price Options
We provide a framework for learning risk-neutral measures (Martingale measures) for pricing options. In a simple geometric Brownian motion model, the price volatility, fixed interest rate and a no-arbitrage condition suffice to determine a unique risk-neutral measure. On the other hand, in our framework, we relax some of these assumptions to obtain a class of allowable risk-neutral measures. We...
متن کاملFactors Affecting Deliberate Learning in First Year Students of Nursing and Midwifery School of Isfahan University of Medical Sciences
Introduction: Medical related professions require continuous deliberated learning in order to promote medical services. It is optimal that students start their professional education with adequate understanding and ability for deliberated learning. This study was conducted to assess the validity and reliability of the questionnaire designed for assessing deliberated learning. Methods: This is ...
متن کاملLearning Martingale Measures From High Frequency Financial Data to Help Option Pricing
We provide a framework for learning risk-neutral measures (Martingale measures) for pricing options from high frequency financial data. In a simple geometric Brownian motion model, a price volatility, a fixed interest rate and a no-arbitrage condition suffice to determine a unique risk-neutral measure. On the other hand, in our framework, we relax some of these assumptions to obtain a class of ...
متن کاملComment on “Simulation-Based Mastery Learning with Deliberate Practice Improves Clinical Performance in Spinal Anesthesia”
Udani et al. have presented the results of a straightforward study on the effect of simulation-basedmastery learning with deliberate practice on clinical performance in spinal anesthesia [1]. And the results of the study are equally straightforward: simulation-based mastery learning with deliberate practice appears to improve the skills of learners. However, there are a number of reasons to be ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015